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Abstract 

We generalize the notion of quasirandom which concerns a class of 

equivalent properties that random graphs satisfy. We show that the con- 

r J ' vergence of a graph sequence under the spectral distance is equivalent to 

the convergence using the (normalized) cut distance. The resulting graph 

limit is called graphlets. We then consider several families of graphlets 

C^ I and, in particular, we characterize quasirandom graphlets with low ranks 

for both dense and sparse graphs. For example, we show that a graph 
sequence Gn, n G Z, converges to a graphlets of rank 2, (i.e., all normal- 
ized eigenvalues Gn converge to except for two eigenvalues converging 
>D , to 1 and p > 0) if and only if the graphlets is the union of 2 quasirandom 

*^ ' graphlets. 
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1 Introduction 



The study of graph hmits originated from quasi-randomness of graphs which 
concerns large equivalent families of graph properties that random graphs sat- 
isfy. Lovasz and Sos [37^ first considered a generalized notion of quasi-randomness 
as the limits of graph sequences. Since then, there has been a great deal of de- 
velopments [nSlli[T0l[2l[2g[2i[27l[3l|33[3i|37l[3i|3S|40l|4TJ|4^ on 
the topic of graph limits. There are two very distinct approaches. The study of 
graph limits for dense graphs is entirely different from that for sparse graphs. 
By dense graphs, we mean graphs on n vertices with cri^ edges for some constant 
c. For a graph sequence of dense graphs, the graph limit is formed by taking 
the limit of the adjacency matrices with entries of each matrix associated with 
squares of equal sizes which partition [0, 1] x [0, 1] (see [SHIIS^). Along this line 
of approach, the graph limit of a sparse graph sequence gives to zero. Conse- 
quently, very different approaches were developed for graph limits of very sparse 
graphs, mostly with vertex degrees bounded above by a constant independent 
of the size of the graph |3l 13 126] . 

To distinguish from previous various definitions for graph limits (called, 
graphons, graphines, etc.), we will call the graph limits by the name of graphlets 
to emphasize the spectral connection. In the subsequent sections, we will give a 
detailed definition for a graphlets as the graph limit of a given graph sequence. 
Although the terminology is sometimes similar to that in differential geometry, 
the definitions are along the line of spectral graph theory [13) and mostly dis- 
crete. In addition, the orthogonal basis of the graphlets of a graph sequence 
can be used, with additional scaling parameters, to provide a universal basis for 
all graphs in the domain (or the union of domains) that we consider. In this 
regard, graphlets can be regarded as playing a similar role as the wavelets do 
for affine spaces. 

To study the convergence of a graph sequence, various different metrics come 
into play for comparing two graphs. For two given graphs, there are many 
different ways to define some notion of distance between them. Usually the 



labeling map assigns consecutive integers to the vertices of a graph which can 
then be associated with equal intervals which partition [0,1]. As opposed to 
the definitions in previous work, we will not use the usual measure or metric 
on the interval [0, 1]. Instead, our measure on [0, 1] will be determined by the 
graph sequences that we consider. Before we proceed to examine the distance 
between two graphs, we remark that there is a great deal of work on distances 
between manifolds [H [31] via isometric embeddings. Although the details are 
obviously different, there are similarities in the efforts for identifying the global 
structures of the objects of interest. We are using elements of [0, 1] as labels for 
the (blow-up) vertices, similar to the exchangeable probabilistic measures that 
were used in [TJ [Ml 123 [2H1 [2i [M] ■ 

Several metrics for defining distances between two graphs originated from 
the quasi-random class of graphs |16[ I19| . One such example involves the sub- 
graph counts, concerning the number of induced (or not necessarily induced) 
subgraphs of G that are isomorphic to a specified graph F. Another such met- 
ric is called the cut metric which came from discrepancy inequalities for graphs. 
The usual discrepancy inequalities in a graph G concern approximating the 
number of edges between two given subsets of vertices by the expected values as 
in a random graph and therefore such discrepancy inequalities can be regarded 
as estimates for the distance of a graph to a random graph. For dense graphs, 
the equivalence of convergence under the subgraph-count metric and the cut 
metric among others are well understood (see [S1[3H])- The methods for dealing 
with dense graph limits have not been effective so far for dealing with sparse 
graphs. A different separate set of metrics has been developed [71 [26] using local 
structures in the neighborhood of each vertex. Instead of subgraph counts, the 
associated metric concerns counting trees and local structures in the "balls" 
around each vertex. The problems of graph limits for sparse graphs are inher- 
ently harder as shown in [7]. Nevertheless, most real world complex networks 
are sparse graphs and the study of graph limits for sparse graphs can be useful 
for understanding the dynamics of large information networks. 

The paper is organized as follows: In Section [2] we first examine the con- 
vergence of degree distributions of graphs and we consider the convergence the 
discrete Laplace operators under the spectral norm. Then we give the definition 
for graphlets in Section 12.41 In Section [3] we give several families of exam- 
ples, including dense graphlets, quasi-random graphlets, bipartite quasi-random 
graphlets and graphlets of bounded rank. In Section [4] we consider the discrep- 
ancy distance between two graphs which can be viewed as a normalization of 
the cut distance. Then we prove the equivalence of the spectral distance and 
the normalized cut distance for both dense and sparse graphs. Note that our 
definition of the discrepancy distance is different from the cut distance as used 
in [7] where a negative result about a similar equivalence was given. In Sections 
[5]and[6l we further examine quasi-random graphlets and bipartite quasi-random 
graphlets for graph sequences with general degree distributions. In Sections [7] 
and [8] we give a number of equivalent properties for certain graphlets of rank 



2 and for general k. In Section ^ we briefly discuss connections between the 
discrete and continuous, further apphcations in finding communities in large 
graphs and possible future work that this paper might lead to. 

We remark that the work here is different from the spectral approach of graph 
limits which focuses on the spectrum of the limit of the adjacency matrices in 
[i5] . If the graph limit is derived from a graph sequence which consists of dense 
and almost regular graphs, the two spectra are essentially the same (differ only 
by a scaling factor). However, a subgraph of a regular graph is not necessarily 
regular. All theorems in this paper hold for general graph sequences for both 
dense graphs and sparse graphs. Some of the methods here can be generalized 
to weighted directed graphs which will not be discussed in this paper. 



2 The spectral norm and spectral distance 

For a weighted graph G = {V, E) with vertex set V and edge set E, we denote 
the adjacency matrix by Aq with rows and columns indexed by vertices in 
V. For an edge {u,v} G E, the edge weight is denoted by Ag{u,v). For a 
vertex v in V{G), the degree of v is dQ{v) — '^^ Ag{u,v). We let Dq denote 
the diagonal matrix with Dg{v,v) — dQ{v). Here we consider graphs without 
isolated vertices. Therefore, we have daiv) > for every v and Dq^ is well 
defined. 

We consider the family of operators W consisting of W : [0, 1] x [0, 1] -^ [0, 1] 
satisfying , W{x, y) — W{y, x). W is said to be of finite type if there is a finite 
partition [Si, ..., S'„) of [0, 1] such that W is constant on each set Si x Sj . Given 
a graph G„ on n vertices, a special finite-type associated with G^ is defined by 
partitioning [0, 1] into n intervals of length \/n and, for a map r] : [0, 1] — > V ^ 
the prc-image of each vertex v corresponds to a interval I(v) = {j /n, {j + 1)/?t.] 
for some j. We can define Wg„ € W by setting : 

WGA^,y) = AgAu,v) (1) 

ifxe /,,(„), and y G /,,(„). 

Suppose we have a sequence of graphs, Gn,n e Z. Our goal is to describe 
the limit of a graph sequence provided it converges. One typical way, as seen in 
[55] . is to take the limit of Wg„- For example, if Gn is in the family of random 
graphs with edge density 1/2, the limit of Wg„ has all entries 1/2. However, 
if we consider sparse graphs such as cycles, then the limit of Wg„ converges to 
identically function. 

Instead, we will define the graph limit J7 to be a measure space as the limit 
of measure spaces ^g^ ^^d the measure /i for Q. is the limit of the measures 
jjLn associated with ^Cn- Ii^ order to deal with the limit, there are a number of 
technical issues in need of clarifications: 



Remark 1. Wc label elements of O by [0, 1]. However, the geometric structure 
of fl can be quite different from the interval [0, 1]. In general, il can be some 
complicated compact space. For example, if the G„ are square grids (as cartesian 
products of two paths), then a natural choice for J7 is a unit square. We will 
write V{i^) = [0, 1] to denote the set of "labels" for fl while 51 can have natural 
descriptions other than [0, 1]. 

Remark 2. In this paper, we mainly concern operators W that are exchangeable 
(see [T1I211I1S1I1H1IM1I31])- Namely, for a Lebesgue measure-preserving bijection 
T : [0, 1] -^ [0, 1], a rearrangement of W, denoted by Wr, acts on functions / 
defined on [0, 1] satisfying 

Wfix) = Wrfirix)). (2) 

We say W is equivalent to Wr and we write W ~ Wr- By an exchangeable 
operator W, we mean the equivalence class of operators Wr where r ranges 
over all measure-preserving bijections on [0, 1]. 

Remark 3. We consider a family of exchangeable self-adjoint operators W* 
which act on the space of functions / : [0, 1] ^- R. Clearly, any exchangeable 
W : [0, 1] X [0, 1] -^ [0, 1] with W{x, y) = W{y, x) is contained in W*. The dis- 
advantage of using such W is the implicit requirement that W{x,y) is supposed 
to be given as a specified value. For r2„ which is derived from a finite graph, it is 
quite straightforward to define the associated Wn as in ([T]). However, Wnix^y) 
as a function of n is quite possible to approach as n goes to infinity. In such 
cases, it is better to treat the limit as an operator. 

Remark 4. Throughout the paper, J F{y)dy denote the usual integration of a 
function F subject to the Lebesgue measure v. We will impose the condition 
that the space of functions that wc focus on are Lebesgue measurable and inte- 
grable so that all the inner products involving integration make sense. Although 
many other measures, such as /i„ and fj, for a graph sequence, are defined in 
Section 12.11 and 12. 2[ it can be easily checked that if a function F is Lebesgue 
measurable and integrable then F is also measurable and integrable subject to 
Hn and /n. 



2.1 The Laplace operator on a graph 

For a weighted graph G„ on n vertices with edge weight An{u, v) for vertices u 
and V , we define the Laplace operator A„ to be 

A„/(h) = ±J2(^f(u)~f{v))AUu,v). (3) 

It is easy to check that 



where >C„ is the symmetric normahzed Laplacian occurring in '13'. 

Let /z„ denote the measure defined by finiv) = dy/yol{G) for v in Gn where 
vol(G„) = J2v ^v- We define an inner product on functions /, g : y — > R by 

It is then straightforward to check that 

{f{u)-f{v)){g{u)-g{v)) E„ /(") E..J.9N - ffW) 



E 






vol(G„) 

(/,A„.g)^„ 



and 

If / and 5 are complex-valued functions, then we have 



where x denotes the complex conjugate of x. 

We note that (/, A„l)^^ = (1,A„/)^^ = 0, where 1 denotes the constant 
function 1. Therefore, A„ has an eigenvalue with an associated eigenfunction 
1, under the /i„-norm. The eigenfunctions 4>j, for j = 0, ... ,n — 1, form an 

1/2 

orthogonal basis under the /i„-norm for G„. In other words, Dn ipj form an or- 
thogonal basis under the usual inner product as eigenvectors for the normalized 
Laplacian / — Dn A^Dn ■ The (/)j's are previously called the combinatorial 
eigenfunctions in |13) . 



2.2 The convergence of degree distributions 

Suppose we have a sequence of graphs. For a graph G„ on n vertices, the 
measure /i„, defined by /i„(w) — ^^Zq \ , is also called the degree distribution 
of Gn where dn{v) denotes the degree of v in G„ and vol(G„) = J2v dn{v)- In 
general, for a subset X of vertices in G„, vo1g„(X) = X]i,ex^"(^)- ^^ *^i^ 
paper, we focus on graph sequences with convergent degree distributions which 
we will describe. 

For a graph G„ with vertex set Vn consisting of n vertices, we let Fn denote 
the set of all bijections from Vn to {1,2,..., n}. 

fn = {?7:K-^{l,2,...,n}}. (4) 



For each ij € Fn, we let rjn denote the associated partition map ?]„ : [0, 1] — > Ki, 
defined by rjn{x) = ri{u) if x £ ((??(«) — ^)/'n,ri{u)/n] = I,i(u)- We sometimes 
write Ijjiu) — lu if there is no confusion. 

Now, for any integrable functions f,g : [0, 1] — > M, we can define 

(/,3)p„,.= / f{x)g{x)f,(^\x) (5) 

Jo 

where 

[ F{x)^l:'\x)^ I F{x)n^i„{r^„{x))dx (6) 

Jo Jo 

for integrable F : [0, 1] — > R. We can then define the associated norm: 

II/IIm„,. = ^/{fJhZ, (7) 

As a measure on [0, 1], /x„ satisfies 

Mn(«) = / mI^H^) 

and 



For example, for a graph G5 with degree sequence (2,2,3,3,4), ^„(w2) = 1/7 
where W2 denotes a vertex having label 2. 

In particular, for a subset S C [0, 1], we consider the characteristic function 
Xs{x) = 1 if x G 5 and otherwise. Then for f ^ g — XSi we have 

(xs,Xs)M„,r, = ^^^n^(S) 

/i"'!^)- (8) 

Sometimes we suppress the labeling map 77 and simply write /i„ as the associated 
measure on [0, 1] if there is no confusion. 

For e > 0, we say two graphs G^ and G„ have e-similar degree distributions 
if 

inf f\^,^^\x)~^,(;^){x)\<e. (9) 

For a graph sequence Gn, n = 1, 2, . . . , we say the degree distribution ^„ is 
Cauchy, if for any e > 0, there exists N = N{e) such that for any m,n > N, 
the degree distributions of Gm and G„ are e-similar. To see that the degree 
distributions converge, we use the following arguments: 



Lemma 1. // the degree distribution of the sequence Gn is Cauchy, then there 
are On G i^„ such that the sequence n-h" of Gn converges to a limit, denoted by 
(jL. Furthermore jjl is unique up to a measure preserving map. 



Proof. For each positive integer j, we set e^ = 2~^, and let N{€j) denote the 
least integer such that for m,n > N{ej), Gm and Gn have ej-similar degree 
distributions. To simplify the notation, we write M{j) — N{ej). 

We first choose an arbitrary permutation ?yM(i) ^-nd then by induction define 
permutations ?7m(j)'s, for j > 1 using ^ so that 

I (^Af(j))/ \ _ (Sm(3 + 1))/ \\ 

For each n £ [M{j), M{j + 1)), we choose the permutation ?;„ such that 



\f^l'"\x)-^,Zzfi-)\<^j- 



Claim: the seqence of ^n" , for n = 1,2,... is Cauchy. 

To prove the claim, we see that for any m,n > M{j) satisfying n G [M{j), M{j-\- 

1)) and TO e [M(fc), M{k + 1)) with j < k, we have 

< I' I i^l^"Hx)-i^Zzfi-) I +/ I i^m\^)-i^nj^\-) I +•■■ 

< 2ej +ej+i + ... + efc-i +2efe 
= 3ej 

and the Claim is proved. 

To show that the sequence fin " converges, we define /i(5) for any measurable 
subset S C [0, 1] as follows: 

fiiS) = lim ^(f")(5) 

= lim f fii'"\x). 

Since /i„ " is Cauchy, the above limit exists and fJ-{S) is well defined. Further- 
more, for any measure preserving map r, /i o r is the limit of /in " ^ . Thus, fj, 
is unique up to a measure preserving map. 



To see that /i is a probabilistic measure, we note that for any e > 0, there is 
some n such that 

1 ,.1 ,.1 







< £• 

Lemma [I] is proved. D 

Remark 5. Since we are deahng with exchangeable operators, the measures 
/z can be regarded as the equivalent class of probabilistic measures where two 
measures t/s, Lp' are said to be equivalent if there is a Lebesgue measure preserving 
bijection t on [0, 1] such that ip = Lp' o t. 

Remark 6. An alternative proof for Lemma [1] is due to Stephen Young [45] 
which is simpler but the resulted limit /z is not necessarily exchangeable. For 
each n, suppose we choose rjn such that /Zn " is a non-decreasing function on 
[0, 1]. By using the fact that for xi < X2 and yi < 2/2, we have \xi — yi\ + \x2 ~ 

y2\ < \xi - y2\ + \x2 - yi|, it follows that 

inf f\^,^l\x)-n^:\x)\^ /V^""H^)-m1''"H^)I 

(ri ) . 

Thus the sequence fin is Cauchy and therefore converges to a limit /i. 

We note that two different graphs G and H both on n vertices can have 
the same degree distribution measure /i„ but G and H have different degree 
sequences. For example, G is a fc-regular graph and iJ is a fc'-regular graph 
where k ^ k' . In this case, fj-niv) = ^/n for any vertex v and /i„(a;) — 1 for any 
x e [0, 1]. To define the convergence of graph sequences, we need to take into 
account the volume vol(G) = J^v '^v of G. 



2.3 The spectral distance 

Suppose we consider two graphs Gm and G„ on m and n vertices. Their associ- 
ated Laplace operators are denoted by A^ and A„, respectively. If to ^ n, A^ 
and A„ have different sizes. In order to compare two given matrices, we need 
some definitions. 

In G„ = (VmEn), for -q ^ F^ (as described in ^), the operator A,i is 
acting on an integrable function / : [0, 1] ^ M by 



Al-'Vlx) = dn{Vn{x)) io ^f^^)-fiv))^n\x,y)dv (10) 



where Wn G W = [0, 1] x [0, 1] is associated with the adjacency matrix An by 
W^^^x^y) — An{rinix),r]n{y))- Here we require / to be Lebesgue raeasurable 
and therefore is also /i„-measurable. In the remainder of the paper, we deal 
with functions that are Lebesgue integrable on [0, 1]. We note that for any two 
permutations 9,r] G Fn, W^^'' is equivalent to W'^'^'^ as exchangeable operators 
in W*, defined in ©. 

For a Lebesque integrable function / : [0, 1] -^ M, we consider 

Jo 

!{x){9{x)-g{y))Wi^){x,y)dyii^:^){x) 



/o Ja dnivix)) 
Using dH]), we have 



{f,Ai^'>9),^., = -TT^J / f{x){g{x)~g{y))wi^\x,y)dxdy 

VOi[Un) Jo Jo 

.1 

(fix) - fiy)){9{x) - g{y))w(,^Hx,y)dxdy. 



2vol(G„; JO JO 

In particular, 

(/, A(:')/)^„,, - 2yf{Gn) lo l! ^^^""^ " /(2/))VW(x,y)dxdy. (11) 

and by taking / ~ xs and g = 1, 

(X5,(/-A(''))l)^„,, = /i('')(5) (12) 

for 5C [0,1]. 

Remark 7. The above inner products are invariant subject to any choice of 
measure preserving maps r. Namely, if we define / o t{x) = /(r(x)), then 

(/, A(:')/)^„,, = (/ o r, A(:'-)/)^„or,,o.. (13) 

For an operator M acting on the space of integrable functions / : [0, 1] — )► R, 
we say Af is exchangeable if for any measure preserving map r, we have 

{f,Mg)^{foT,Mr{goT)) 

where M^ is defined by MTh{x,y) — M{h{T{x),h{T{y))). Clearly, A„ is an 
exchangeable operator. 

For an integrable function / : [0,1] — > R and r/n G F„, we define /„ : 

Vn U [0, 1] ^ R, 

f f , 7 / s Il(u)fi^)t^'^^\^)dx _ Jj^^)fix)tJ.'h'"\x)dx 

Iliu)f'^^"\x)dx Mn(w) 

(15) 
10 



/ and / are related as follows: 

Lemma 2. 

(i) For X e lu, 

Ai^-Vlx) = TT^T.[ ifi^)~fiy))AniVnix),v)dy 

= A„/„(7.) + (fix) - U{u)). 
(ii) For f,g: [0,1]^ R, 

(/,A(:'")/)^„,,„ = (/,A„/\„ + ||/-/||2_^_^„. (16) 

The proof of (i) follows from ([3]) and p4| . (ii) follows from (i) and (fTTj) by 
straightforward manipulation. 

Remark 8. In this paper, we define inner products and norms on the space of 
integrable functions defined on [0, 1], as seen in ^ and ([7]). Consequently, the 
last term in (|16p approaches as n goes to infinity. Namely, 

ll/~/llL,'?n -^0 asn->oo 

if / is integrable. This implies that the graph Laplacian A„ for G„ acting on 
the space of functions defined on V„ can be approximated by A„ " acting on 
the space of functions defined on [0, 1] with the exception for the function / 
with ||A„/||^^_^^ is too close to 0, while / is orthogonal to the eigenfunction 
associated with eigenvalue 0. The case of a path P„ is one such example and 
in fact, the graph sequence of paths P„ does not converge under the spectral 
distance that we shall define. In order to make sure that A„ "' closely approxi- 
mates A„, there are two ways to proceed. We can restrict (implicitly) ourselves 
to graph sequences G„ with the least nontrivial eigenvalue Ai of A„ greater 
than some absolute positive constant (as done in this paper). An alternative 
way is to consider general labeling space fig other than [0, 1] and impose further 
conditions on the space of functions defined on Qq (which will be done in a 
subsequent paper). 

For a graph sequence G„ = {Vn, En), for n = 1,2,..., we say the sequence 
of the Laplace operators A„ is Cauchy if for any e > there exists N such that 
for m,n> N, there exist 9m G Fm, On S Fn such that the following holds: 
(i) The associated measures Hm and Hm satisfy 



11 



(ii) The Laplace operators associated with Gm and Gn satisfy 



for integrable /, g defined on [0, 1] and we write 



< e 



(i(A„, A„)^„ 



< e 



(17) 



where firm Mn denote the degree distributions of Gm, Gn, respectively. 



Remark 9. We note that the spectral distance here is invariant subject to any 
choices of measure preserving maps. In fact, for any measure preserving map 
T, it follows from the definition and that d(Am, A„)p^^^^ < e if and only if 

a\^ri%, ^n) llmOT-JlnOT "^ ^■ 

Suppose the sequence of graphs G„ = {Vn,En) has degree distributions /i„ 
converging to /x as above. Then (J17l) can be simplified. The inequality in (J17l) 
can be replaced by an equivalent condition 

d(A™, A„)p < e 

which can be described by there exists N such that for m,n > N , there exist 
9m G Fm , On & Fn such that the Laplace operators associated with G,,,. and G„ 
satisfy 



(/,(Ai^)-A^))g), 



< e 



(18) 



for integrable f,g : [0, 1] ^ M with ||/||^ = ||g||^ = 1 . 

For an operator M on [0, 1] we can define spectral /x-norm, defined by 

\\M\\l ^ sup \{f,Mg)^\ 

where f,g : [0, 1] — > R range over integrable functions satisfy ||/||p = \\g\\fi = 1- 

Lemma 3. For a graph sequence Gn ~ {Vn, En), for n = 1,2,..., suppose the 
sequence of the Laplace operators A„ is Cauchy, then for each n, there are per- 
mutations On & Fn such that the sequence of An " converges to an exchangeable 

/a \ 

operator A and the measure pin " of Gn 's converge to /i where A satisfies 



f{x)Ag{x)fi{x) ^ lim(/,A^)g)^„ (19) 



for any two integrable functions f,g : [0, 1] 
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Proof. For each positive integer j, we set ej = 2~^ , and let N{€j) denote the 
least integer such that for m,n > N{ej), (|17p holds for ej. To simplify the 
notation, we write M{j) — N{ej). 

We first choose an arbitrary permutation ryn) G ^Mij) s-nd then by induction 
define permutations 0(j) e Fm(j)'s, for j > 1, using ([T7)) so that 

C?(AA.f(j),AA/(j + l)) (flu)) (eo+i)) <ej- 

We can assume the associated measure for ^q) is non-decreasing since we can 
simply adjust by choosing measure preserving maps. 

For each n E [M{j), M{j + 1)), we choose the permutation 6'„ such that 



(i(A„, AifQ) 



„(«„) ,("0)) < '^3- 



We will use a similar method as in Lemma [T] to prove the following: 

Claim 1: The sequence of A„ " , for rt = 1, 2, . . . is Cauchy. 

To prove the claim, we see that for any m,n > M(j) satisfying n G [M{j), M{j~ 
1)) and TO € [M(k), M{k + 1)) with j < k, we have 

d{Am, A„) (e„) (0™) 

< d(A„,AM(j)) (,„) (e(^)) +... 

+d(Ajv/(fe-i), AM(fe)) (e(fe_i)) (e(fe)) + C?(AM(fe), A,„) (e(fe)) (s^. 



< 2e, 
= 3eo 



fj+i 



+ efe_i +2ek 



and Claim 1 is proved. 



,(«■ 



Claim 2: The sequence of /i„ ' is Cauchy and therefore converges to a limit /i. 
To prove Claim 2, we will first show that for any e > 0, m, n > N{£j), and any 

subset S C [0, 1], we have \nln"'\s) - fJ.'n"\s)\ < 6ej. 

From the proof of Claim 1, we know that d(Am"' , A„ "•^) < 3ej, which 
imphes, by choosing f — xs and g = 1 in pG)) and p2)) . 



3., > d(A^),A^)) 



> 



> 



^/^i^^) - ,/^F\s) 



tiZ-\s)~^,i:-\s) 



\S) 



^ii'"\s) 



> - 

- 2 



^(,^)(5)-M^^^) 
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(Q \ ( f) \ (ft \ 

To show that fin ^ is Cauchy, we set S — {x : fin "" (x) > fiin^ (x)}. Then, 



s 



= 2|M^H5)~M,^H5) 



< 12e,- 
Claim 2 is proved. 

Now, we can define the operator A: 

(/,Ag) = / f{x)AgixMx) (20) 

Jo 

= lim(/,Al«")5)^„ (21) 

for any two integrable functions /, g : [0, 1] — J> R. 

Combining Claims 1 and 2, the sequence A„ "-^ converges to a limit A. D 

For a graph sequence Gn,n = 1,2,..., the Laplace operator A^ of G„ and 
Wg„ S W* are related as follows: For functions f,g : [0, 1] — >■ R, by using ([SJ 
we have 

(/, (/„ - A(''))g)^„,, = /' /(a:)((/„ - Ai^^)g{x))^^i^Hx) 

Jo 

f{x)g{y)Wi^\x,y)dyii'^^\x) 



Jo dn{r]{x))- 

f(x)g{y)Wl:P\x,v)dxdy 



VO^Gn) Jo Jo 

although the existence of the limit of Wg^ is not necessarily required. 

There are similarities between A and the previous definitions for graph limits 
(as defined in [38]) but the scaling is different as seen below: 

f{x){{I-A)g{x))^l{x) = hm / f{x){{I-A[:'^)g){x)^l^:\x) 

9 

= lini(/,— — -W^„5). (22) 

n^oo VOl(G„) 

Suppose the graph sequence have volume vol(Gn) converging to a function $. 
Then we have 

71 

f{x){{I-A)g{x))^,{x) = hm (/, — -— W^G„g) (23) 

«-i-oo vol(G„) 

Thus, the Laplace operator A as a limit of A„ is essentially the identity operator 
minus a scaled multiple of the limit W. We state here the following useful fact 
which follows from Lemma [S] 
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Lemma 4. Suppose a sequence of graphs Gn, n e Z„ has degree distribution /i„ 
converging to fi. Then the associated Laplace operators A„,n £ Z, converge to 
A satisfying 

{Xs, {I - A)1>M = /4^) > 0, (24) 

and (xs,(/-A)xt)^>0 (25) 

for any integrable subsets S', T C [0, 1] where 1 is the constant function assuming 
the value 1. 

Proof. The proof of ((24|) follows from the fact that 

(Xs,(/-A)l)^ = lim(xlr\(/-A«)l)/.„ 

= lim (xs"\l>M>. 
- lim M„(5(")) 

n— ^oo 

= m('S). 

To see (j25p . we note that for any two vertices u,v in G„, {xu, [I ~ A„)xi;)^ = 
^„(m,z;)/vo1(G„)>0. D 



2.4 Defining the graphlets 

Using the convergence definitions in the previous subsections, we are finally 
ready to define graphlets. We say a graph sequence converge to a graphlets 
VL = r2(/x, A), denoted by 

Gi,G2,...,G„,---^f7 (26) 

if the following conditions hold: 

1. The degree distributions of G„ introduce measures /j,„ on fi and /i„'s 
converge to a measure pt for fJ as in ([9]). 

2. The discrete Laplace operators A„ for G„ converge to A as an operator 
on f2 under the spectral distance using the /i-norm as in ([3]). 

3. The volume vol(G„) of Gn is increasing in n. 

Another way to describe a graphlets i7 is to view 57 as the limit of 57„. Here 
the graphlets ri„ can be described as a measure space under a measure /i„ as 
follows. The elements in n„, (the same as that of fi, labelled by [0, 1]) is the 
union of n parts, denoted by l{v\ indexed by vertices v of G„. Furthermore, 
the Laplace operator A„ can be defined by using the adjacency entry A„(u, v) = 
W„(x,2/) for X e I(u) and y £ I{y). Namely, A,i{x,y) = (/„ - Wn){x,y). The 
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graphlets fi = (/i, A) as the limit of J7„(/i„, A„) specifies the incidence quantitiy 
between any two integrable subsets 5* and T in f2. For an integrable 5' C J7, we 
let xs denote the characteristic function of S, which assume the value 1 on S', 
and otherwise. In fl„, the incidence quantity between S and T, denoted by 
£n{S,T) satisfies: 

£n{S,T) = vol(G„) / xs{x){{I-A)xT{x))fi{x). (27) 

Jo 

In particular, for S = T, 

En{S,S) « vo\{Gn){fi{S) ~ fi{d{S))) 
where the boundary d{S) of S satisfies 

Er.{S,S)^vo\{Gn)Kd{S))^vo\{Gn) I xs{x)^xs{x)^i{x). 

Jo 

and the degree of v satisfies 

dn{v) w vol(G„) / fi{x). 

Jl(v) 

Examples of graphlets will be illustrated in Section |31 



3 Examples of graphlets 

We here consider several examples of graphlets Q, — ft{fJ,, A) which are formed 
from graph sequences Gn,n G Z. We will illustrate that the eigenfunctions of 
A can be used to serve as a universal basis for graphs Gn- The discretized 
adaptation of graphlets will be called 'lifted graphlets" for Gn, which are good 
approximations for the actual eigenfunctions in G„ as n approaches infinity. In 
some cases, the lifted graphlets using A are fewer than the number of eigenfunc- 
tions in Gn and in other cases, there are more eigenfunctions of A than those 
of G„. We will describe a universal basis for G„, as the union of two parts, 
including the primary series (which are the lifted graphlets) and complementary 
series (which are orthogonal to the primary series). In a way, we will see that 
the primary series captures the main structures of the graphs while the comple- 
mentary series reflect the "noise" toward the convergence. Before we proceed, 
some clarifications are in order. 

• The notion of orthogonality refers to the usual inner product unless we 
specify other modified inner products such as the /i-product {■, ■)p, or the 
/i„-product. Sometimes, it is more elegant to use eigenfunctions that are 
orthogonal under the /i-norm. However, when we are dealing with a finite 
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graph Gn in a graph sequence, we sometimes wish to use only what we 
know about the finite graph Gn and perhaps the existence of the hmit 
without the knowledge of the behavior of the limit (such as /i) . In such 
cases, we will use the usual inner product. 

• The universal bases are for approximating the cigenfunctions of the nor- 
malized Laplacian of Gn- In a graph G„, its Laplace operator A„ = 
/ — D^^An is not symmetric in general since the left and right cigen- 
functions are not necessarily the same. The universal basis is used for 
approximating the cigenfunctions of the normalized Lapalcian 



i^n — ^ ^-^n ^^n^-^n 



/2 



which is equivalent to A„ and is symmetric. Thus, C has orthogonal 
cigenfunctions. 



3.1 Dense graphlets 

Suppose we have a sequence of dense graphs Gn,n G Z, with vol(G„) = 
2|£'(G„)| = CniT^ where the c„ converge to a constant c > 0. In this case, 
the /x-norm is equivalent to other norms such as the cut-norm and subgraph- 
norm in J9]. By using the regularity lemma, the graphlets f2 of a dense graph 
sequence is of a finite type. In other words, there is a graph H on h vertices 
where /i is a contant (independent of n) such that Q = fin- Let ipi, . . . ,iph 
denote the cigenfunctions of H. 

For n — hm and m G Z, we will describe a basis for a graph Gn- The 
primary cigenfunctions can be written as 

4>j{v) — ifj{\v/Tn~\) where w G {1, 2, . . . , n} and j = 1, . . . , ft., 

while the complementary cigenfunctions consist oi n — h = {ni — l)h cigenfunc- 
tions as follows: For l<a<h,l<b<m— I, 

(„) , , (^2rr^bb'/m ^ a' + I = U, 

raiyam + b) = \ . 

otherwise. 



3.2 Quasi-random graphlets 

Originally, quasi-randomness is an equivalent class of graph properties that are 
shared by random graphs (see [El). In the language of graph limits, quasi- 
random graph properties with edge density 1/2 can be described as a graph 
sequence Gn,n G Z, converging to graphlets 51 where fl = [0,1] satisfying 
W{x,y) = 1/2 and iJ,{x) = fi{y) for all x,y and vol(G'„) = (a)/^- Compared 
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with the original equivalent quasi-random properties for Gn (included in paren- 
theses), the quasi-random graphlets with edge density 1/2 satisfies the following 
equivalent statements. 

(1) GnjiT- G Z, converge to graphlets il in the spectral distance. 

(The eigenvalue property: The adjacency matrix of Gn on n vertices 
has one eigenvalue n/2 + o{n) with all other eigenvalues o{n). ) 

(2) GmTL G Z, converge to graphlets il in the cut-distance. 

(The discrepancy property: For any two subsets S and T of the vertex 
set of Gn, there are \S\ ■ |T|/2 + o(n^) ordered pairs {u, v) with u £ S,v €T 
and {u^v} being an edge of Gn- ) 

(3) Gn,n G Z, converge to graphlets fi in the C4-count-distance. 

(The co-degree property: For all but o{n^) pairs of vertices u and v 
in Gn, u and v have n/A -\- o(n) common neighbors.) 
(The trace property.' The trace of the adjacency matrix to the Ath power 
isnVl6 + o(7i4).) 

(4) Gn,n G Z, converge to graphlets il in the subgraph-count-distance. 

(The subgraph-property: For fixed k > 4 and for any H on k vertices 
and I edges, the number of occurrence of H as subgraphs in Gn is n''/2^ + 
o{n^).) 

(5) GmTi G Z, converge to graphlets in the homomorphism-distance. 

(The induced-subgraph-property: For fixed k > A and for any H on 

k vertices, the number of occurrence of H as induced subgraphs in Gn is 
n^/2{^) -t-o(n'=). ) 

For a quasi-random graph sequence converging to J7, the primary graphlets 
for Gn consists of the all I's vector 1 and the complementary ones are irrelevant 
in the sense that they can be any arbitrarily chosen orthogonal functions. The 
generalization of quasi-randomness to sparse graphs and to graphs with general 
degree distributions [17J[TH] can also be described in the framework of graphlets. 
In the previous work on quasi-random graphs with given degree distributions, 
the results are not as strong since additional conditions are required in order 
to overcome various difficulties [T71IIH]- By using graph limits, such obstacles 
and additional conditions can be removed. In Section [SJ we will give a com- 
plete characterization for quasi-random graphlets with any given general degree 
distribution which include the sparse cases. 



3.3 Bipartite quasi-random graphlets 

A bipartite quasi-random graphlets $7 is basically a weighted complete bipartite 
graphlets. fi can be partitioned into two parts A and B while W{x,y) is equal 
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to some constant p ii {x £ A,y £ B) or {x £ B,y € A), and otherwise. There 
are two nontrivial eigenvalues of / — A, namely, 1 and —1. The eigenfuction (po 
associated with eigenvalue 1 assumes the value (j)o{x) = l/y^fj,{A) for x e A and 
'Po{y) = l/-\//i(-B) for y E B. The eigenfunction 0i associated with eigenvalue 
— 1 is defined by (t>i{x) = l/^/JI{A) ior x e A and (/'i(y) ~ —l/y/fi{B) for 
y£B. 

The bipartite version of quasi-random graphs is useful in the proof of the reg- 
ularity lemma [H]. Bipartite quasi-random graphlets, as well as quasi-random 
graphlets, serve as the basic building blocks for general types of graphlets. More 
on this will be given in Sections [7] and [S] 



3.4 Graphlets of bounded rank 

A quasi-random sequence is a graph sequence which converges to a graphlets 
of rank 1 . We will further consider the generalization of graph sequences which 
converge to a graphlets of rank k. This will be further examined in Sections [7] 
andm 



4 The spectral distance and the discrepancy dis- 
tance 

4.1 The cut distance and the discrepancy distance 

In previous studies of graph limits, a so-called cut metric that is often used for 
which the distance of two graphs G and H which share the same set of vertices 
V is measured by the following (see [91130]). 

cut(G,ff)--^ sup \Eg{S,T)-Eg'{S,T)\ (28) 

|v r S,TCV 

where Eq{S, T) denotes the number of ordered pairs (u, v) where u is in S, v is 
in T and {u, v} is an edge in G. 

We will define a discrepancy distance which is similar to but different from 
the above cut distance. For two graphs G and H on the same vertex set V, the 
discrepancy distance, denoted by disc(G, H) is defined as follows: 



disc(G, H) — sup 

S,TCV 



Eg{S,T) Eh{S,T) 



^yo\g{S)yo\g{T) ^vo\h{S)yo\h{T) 



(29) 



We remark that the only difference between the the cut distance and the dis- 
crepancy distance is in the normalizing factor which will be useful in the proof 
later. 
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For two graphs Gm and G„ with m and n vertices respectively, we use the 
labehng maps 9 and 77 to map the vertices of Gm and G„ to [0, 1], respectively. 
We define the measures ^m and Hn on [0, 1] using the degree sequences of Gm 
and Gn repectively, as in Section [521 From the definitions and substitutions, 
we can write: 

Eg^ [S, T) = vol(G„)(xs, (/ - A„)XT)M„.e. (30) 

Therefore the discrepancy distance in ([2^ can be written in the following general 
format: 

disc (G„,G„) 

(xs, {I - A„)xT)A<™,e (xs, [I - A„)xt),x„,,, 



inf sup 

9&F^:V^F„ S,T<Z[0,1] 



y'JhJ^S)iI^jT) ^/JlJSJJIJT) 



(31) 



where S,T range over all integrable subsets of [0, 1]. We can rewrite ([50]) as 
follows. 

EgAS,T) - vol(G„) / xsix){{I ^ An)xT)ix)finix). (32) 

Alternatively, Eg^{S,T) was previously expressed (see [38]) as follows: 

Eg^{S,T) = n^ f f W{x,y)dsdt (33) 

The two formulations (15^ and p3l) look quite diff^erent but are of the same 
form when the graphs involved are regular. However, the format in (|33)) seems 
hard to extend to general graph sequences with smaller edge density. 



Although the above definition in ([3T|) seems complicated, it can be simplified 
when the degree sequences converge. Then, /!„ and /i„ are to be approximated 
by the measure ^ of the graph limit. In such cases, we define 



disc^(Gm,G„) = inf sup 



(xs,A„xt)a<,0 (xs,A„XT)/i,,7 



.MSMT) y/KSMT) 



" ^'^P — F====I(XS,(A™-A„)XT)MI• 
S,TC[0,1] ^Jti{S)^.(T) 

where 5, T range over all integrable subsets of [0, 1] and we suppress the labelings 
0, rj which achieve the infininum. 

We will show that the convergence using the spectral distance defined under 
the /i-norm is equivalent to the convergence using the discrepancy distance in 
Section m 
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4.2 The equivalence of convergence using spectral distance 
and the discrepancy distance 

We will prove the following theorem which holds without any density restriction 
on the graph sequence. The proof extends similar techniques in Bilu and Linial 
[5] and [6l [11] for regular or random-like graphs to graph sequences of general 
degree distributions. 

Theorem 1. Suppose the degree distributions /i„, of a graph sequence Gn,n £ Z 
converges to fj,. The following statements are equivalent: 

(1) Gn,n G Z, converges under the spectral distance. 

(2) Gn,n £ Z, converges under the disc-distance. 

Proof. Suppose that for a given e > 0, there exists an A^ > 1/e such that for 
n > TV, we have 

||m« -mIIi < £• 

The proof for (1) => (2) is rather straightforward and can be shown as follows: 

Suppose (1) holds and we have, for m,n > N, ||/im — /x||i < e, ||/i„ — /i||i < e 
and II Am — A„||^ < e. (Here we omit the labeling maps 9m, 0n to simpHfy the 
notation.) Then, 



disc(G,„,G„) = sup 

S,TC[0,1 



(xs,(^- A„)xt)m,„ (xs,(/- A„)xt)a 



\/JhJMihJXJ ^^„(5)^„(T) 

1 

S,Tcfo,l] y/fl{S)n{T) 



< sup ^====|(xs,(A™-A„)xT)Ml+2e 



s,Tc[Q,i] IIxsIUIIxtIU 

< ||A,„-A„||^ + 2e 
<3e. 

To prove (2) => (1), we assume that for M ~ A„ — A„j 



\{xs,MxT)^.\<eVl4S)t4T) (34) 

for some e > for any two intcgrable subsets S,T C [0, 1]. It is enough to show 
that for any two integrable functions /, g : [0, 1] -^ R, we have 

|(/,Af5)Ml<20elog(l/e)||/|UI|g|U (35) 

provided e < .02. 
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The proof of (|35p follows a sequence of claims. 

Claim 1: For an integrable function / defined on [0, 1] with ||/||^ = 1, for any 

e > 0, there exists an N{e) such that for any n > N{e) there is a function h 

defined on [0, 1] satisfying : 

{l)\\h\\^<l, 

(2)|l/-/i|U<l/4 + e, 

(3) The value h{y) in the interval ((j ~ l)/n^i /n] is a constant hj and hj is of 

the form (|)-' for integers j. 

Proof of Fact 1: Since / is integrable, for a given e, we can approximate ||/||^ 

by a function /, with f{x) — fj in ((j — l)/'mn,j/mn], such that 



\f-fy{x)^,ix) 



< e. 



For / = {fj)i<j<mn, we define h = {hj)i<j<mn as follows. If fj = 0, we set 
hj — 0. Suppose fj ^ 0, there is a unique integer fc so that (4/5)'^ < \fj\ < 
(4/5)'^"^. We set hj — sign(/)(|)'^ where sign(/j) = 1 if fj is positive and —1 
otherwise. Then 

which implies \\f-h\\l < e + Ejo \f,-h,\'fi{x) < 6 + j^Et l/jf A^W = Te+^- 
Claim 1 is proved. 

Claim 2: Suppose there are functions f',g' satifying ||A/||p = |(/', A/g')p| and 
ll/'IU = llff'IU = 1- K /'.9 are functions such that ||/||^, ||.g||,, < 1 and \\f' - 
/lU < 1/4 + e,||.9' - .g||,. < 1/4 + e, then 

||M|U<(2 + 4e)|(/,Af5)^|. (36) 



Claim 2 can be proved by using Claim 1 as follows: 

||M|U - |(/',Af.9%| 

< \{f,Mg)^\ + \{f- f,Mg)^\ + \{f' , M{g' - g))^\ 

< |(/,Afg)^| + (^ + 2e)||M||^. 

This implies |1A/||^ < (2 + 4e)|(/, Af^)^], as desired. 

From Claims 1 and 2, we can upper bound ||Af ||^ to within a multiplicative 
factor of 2+4e by bounding of | (/, Mg)f^\ with /, g of the following form: Namely, 
/ = X^((l)*/^*^ where the Z^*-' denotes the indicator function of {x : f{x) = 
(|)'}. Similarly we write g = ^^((f)*^^*^ where the g^*^ denotes the indicator 
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function of {y : gijj) = (f)*}- Now we choose k — log4/5 ^ ^^^ we consider 



\{f.Mg),\<Y,hY^' {f^'\Mg('^} 



< E itr'\if^'\Mg('^), 



s ^ t 



^X + Y + Z. 

We now bound the three terms separately. For a function /, we denote fJ.{f) = 
/i(supp(/)) to be the measure of the support of /. Using the assumption ([M| 
for (0, l)-vectors and the fact that /^^^'s are orthogonal (as well as the (7'*'''s), 
we have 



^= E i^r^'\if^'^^Mg^'^) 

\s — t\<.K 



^' E hY^yf^if^'^Mg^'^) 



\s-t\<K 

< ^^^(E(J)"m(/^^^)+E(^)"M5(*^)) 

s t 

<e(2K+l), 
since each term can appear at most 2k + 1 times. For the second term we have, 
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by using Lemmas SI the following: 



s ^ t ^ ' 



t 



S 

s 
s 
s 

The third term can be bounded in a similar way. Together, we have 
p/||„<(2 + 4f)(,(2K + l) + 4(i)-) 

^<^-^"('(^^ + " + ^«) 

< 20elog(l/e) 
j^ g,^ ~ 17.93 and e < .02. This completes the proof of the theorem. D 



5 Quasi-random graphlets with general degree 
distributions — graphlets of rank 1 

We consider a graph sequence that consists of quasi-random graphs with de- 
gree distributions converging to some general degree distribution. We will give 
characterizations for a quasi-random graph sequence by stating a number of 
equivalent properties. Although the proof is mainly by summarizing previous 
known facts, the format of graph limits helps in simplifying the previous various 
statements for quasi-random graphs with general degree distributions including 
the cases for sparse graphs. 

Theorem 2. The following statements for a graph sequence G„, n — 1,2,..., 
are equivalent: 

{i) Gn,n G Z, form a quasi-random sequence with degree distribution converg- 
ing to fi. 
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(a) Gn, n £2,, converge to a measure space O with measure ji and VL is of rank 
1. Namely, I — A has one nontrivial eigenvalue 1. (Equivalently, for each 
n, I — A„ has all eigenvalue o(l) with the exception of one eigenvalue 1.) 

{Hi) G„,n G Z, converge to a measure space fl with measure jjl and the Laplace 
operator A on fJ satisfies 

f{x){{I~A)g){x)^i{x)^ f f{x)pi{x) [ g{x)fi{x) 

for any integrable f,g:fl^W. 
(iv) The degree distribution fin of Gn converges to jj, and 

where An and Dn denote the adjacency matrix and diagonal degree matrix 
of Gn, respectively. Here \\ ■ \\ denotes the usual spectral norm (in L2) and 
J denotes the all 1 's matrix. 

(v) There exists a sequence e„ which approaches as n goes to infinity such 
that Gn satisfies the property P{£n), namely, that the degree distribution 
fin converges to fj, and for all S,T C_Vn 



P{^n) ■■ 



EiS,T) ^°1(^^^°1(^) 



vol(G, 
where E{S,T) = Eses,teT^(«'0- 



<e„v/vol(5)vol(T) (37) 



Remark 10. Before proceeding to prove Theoreni[5J we note that a sequence 
of random graphs with degree distribution /i„ converging to fi is an example 
satisfying the above properties. Here we use random graph model Gd for a 
given degree sequence d = (d„)i,GG defined by choosing {u,v} as an edge with 
probability dudy/J2s ^^ f*-"" ^^y two vertices u and v, (see [20]). 

Remark 11. The above list of equivalent properties does not include the mea- 
surement of counting subgraphs. Indeed, the problem of enumerating subgraphs 
in a sparse graph can be inherently diflficult because, for example, a random 
graph G{n,p) with p = o{n~^''^) contains very few four cycles. Consequently, 
the error bounds could be proportionally quite large. 

Instead of counting C4, we can consider an even cycle G2k or the trace of 
(2fc)th power, leading to the following condition: 

(vi) For some constant k (depending only on the degree sequence), a graph 
sequence Gn satisfies 

\Trace{I - Anf - l\ = o(l). 
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Remark 12. Suppose that in a graph Gn, all eigenvalues of / — A„ except for 
eigenvalue 1 are strictly smaller than 1. Then as k goes to infinity, the trace of 
the fcth power of / — A approaches 1. How should {vi) be modified in a way 
that it can be an equivalent property to (i) through (v) ? We will leave this as 
an intriguing question. 

Question 1. Is (vi) equivalent to (i) through (v) for some constant k depending 
only on ri? 

Remark 13. It is easily checked that [vi) implies {ii). For the case of dense 
graphs, the reverse direction holds JJj. For general graphs, to prove {ii) — >■ 
[vi) involves the spectral distribution. For example, for a regular graph on n 
vertices and degree d, a necessary condition for [vi] to hold is that nd^/'^ < en- 
In particular, if the spectrum of the graph satisfies the semi-circle law, then 
this necessarily condition is also sufficient. For a general graph, the necessary 
condition should be replaced by rw?/^ < e„ where d is the second order average 
degree, namely, d = J^v '^v/ Sd ^-v Nevertheless, there are quasi-random graphs 
that satisfy (ii) but require k much larger than 2 log n/ log d. For example, we 
can take the product of a quasi-random graph Gp and a complete graph Kq 
which is formed by replacing each vertex of Gp with a copy of Kq and replacing 
each edge in Gp by a complete bipartite graph Kq^. 

Question 2. A subgraph F is said to be forcing if when the number of occurrence 
of F in a graph G is close to what is expected in a random graph with the same 
degree sequence, then all subgraphs with a hounded number of vertices occur in 
G close to the expected values in a random graph with the same degree sequence. 
A natural problem is to determine subgraphs which are forcing for quasi-random 
graphs with general degree sequences. 

Proof of Theorem\Bi We will show [i) =► (w) =► [iv) => {Hi) => {ii) => {i). 

We note that (i) => {v) follows from the implications of quasi-randomness 
for graphs with general degree distributions 18 . Also, (w) => {iv) follows from 
the fact that {iv) is one of the equivalent quasi-random properties. 

To see {iv) <=> {Hi), we note that the Laplace operator A„ of G„ satisfies. 
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forany/,g:F(G„)^K, 

/(a;)(/- A„)5(x)^„(x) - / /(x)^„(x) / g(x)/^„(x) 



|(/,(/-A„)g)^„-(/,l)^„(.9,l)^J 






i;ey„ 



/X-^/MA.-^^)i?-^/V 



vol(G„) 



where /' — D,/ //vol(G'„) and g' ~ DJ g/vol(G„). To prove (Hi) ^ (iv), we 
have from (in), 



f{x){I-A)g{x)nn{x)- / f{x)nn{x) / g(x)^„(a;) 



< e«ll/'llll5'll 



P{x)^in{x) / g^{x)^inix) 



Since /x„ converges to fi and e„ goes to as n approaches infinity, (iv) => (wi) 
is proved. The other direction can be proved in a similar way. 

(iii) => (ii) follows from the fact that / — A is of rank 1. All adjacency 
matrices A„ are close to a rank 1 matrix and therefore fl is of rank 1. 

To prove that (ii) =^ (i), we use the fact that for any graph the Laplace 
operator is a sum of projections of eigenspaces. If fl is of rank 1, there is only 
one main eigenspace of dimension 1 (associated with the Perron vector) for the 
normalized adjacency matrix. D 



6 Bipartite quasi-random graphlets with gen- 
eral degree distributions 

We consider the graph limit of a graph sequence consisting of bipartite quasi- 
random graphs with degree distributions converging to some general degree 
distribution. The characterizations for a bipartite quasi-random graph sequence 



27 



are similar but different from those of quasi-random graphs. Because of the role 
that bipartite quasi-random graphlets plays in general graphlets, we will state a 
number of equivalent properties. The proof is quite similar to that for Theorem 
[5] and will be omitted. 

Theorem 3. The following statements for a graph sequence Gn,n G Z are 
equivalent: 

(i) Gn,n £ Z, form a bipartite quasi-random sequence with degree distribution 
converging to fi. 

(ii) Gn,n £ Z, converge to a measure space fl with measure ii and / — A has 
two nontrivial eigenvalues 1 and —1. Namely, for each n, I — A„ has all 
eigenvalues o(l) with exceptions oftwo eigenvalues 1 and —1. 

(iii) Gn,n £ Z, converge to a measure space fl with measure fi. For some 
X d fl, the Laplace operator A satisfies 

f{x){I - A)g{x)n{x) 



f{x)fj.{x) / g{x)^i{x) + / fix)ti{x) / g{x)fJ.{x) 
xex Jxex Jxex Jxex 

for any f,g:fl^M. where X denotes the complement of X . 

(iv) The degree distribution fXnOf Gn converges to fi and 

" " ^ " vol(G„) ^ " "" ^^ 

where Jx xix, y) — 1 if (x E X and y G X ) and otherwise. 

(v) There exist X G ^ and a sequence e„ which approaches as n goes to 
infinity such that the bipartite graphs Gn satisfies the property that the 
degree distribution fin converges to /i and for all S,T QVn 

{yo\{S n X)yo\{T nX) + yol{S n X)yol(T n X)) 
E{S, T) 



vol(G„) 



< e„Vvol(5)vol(r) 
where E{S,T) = EseS.teT^(«'0- 

7 Graphlets with rank 2 

It is quite natural to generalize rank 1 graphlets to graphlets of higher ranks. 
The case of rank 2 graphlets is particularly of interest, for example, in the sense 
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for identifying two 'communities' in one massive graph. For two graphs with 
the same vertex set, the union of two graphs Gi = {V, Ei) and G2 = {V, E2) has 
the edge set E = Ei U E2 and with edge weight w{u,v) — wi(u,v) + W2{u,v) 
if Wi denotes the edge weights in Gi . We will prove the following theorem for 
graphlets of rank 2. 

Theorem 4. The following statements are equivalent for a graph sequence G„ , S 
Z, where all Gj 's are connected: 

(i) Gn, n G Z, converge to a graphlets fl and /—A has two nontrivial eigenvalues 
1 and p G (0, 1). Namely, or each n, I ~ A„ has all eigenvalues o(l) with the 
exception of two eigenvalue 1 and p. 

(ii) Gn,n G Z, converge to a measure space 51 and il = Oi U fi2 where fij are 
intersecting quasi-random graphlets (of rank \). 

(Hi) Gn,n G Z, converge to a measure space f2 with measure p where p = 
api + (1 — a)p2 for some a G [0, 1] and the Laplace operator A on fl satisfies 

f{x){I ~ A)g{x)p{x) 



= a f{x)pi{x) I g{x)pi{x) + {l-a) I f{x)p2{x) I g{x)p2{x) 
Jn Jn Jn Jn 

for any /, 5 : 51 — S> M. 

(iv) The degree sequence {dy)v^v of Gn can he decomposed as dy = d'^ +(i" with 
d'y > and d" > 0. The adjacency matrix An of Gn satisfies: 

"^" ^^" vo1(G;J vo1(G;0^ " ii-«uj- 

(v) There exists a sequence en which approaches as n goes to infinity such 
that the degree sequence {dy)y^v of Gn can be decomposed as dy = dy + dy with 
d'y >0 and d" > 0. Furthermore, for all S,T CVn 

En{S,T)~ ^ol'('^)^ol'(^) vol"(^)vol"(T) 



vo1(G;,) vo\iG-) 



< e„Vvol(5)vol(r). 



Before we proceed to prove Theorem HI we first prove several key facts that 
will be used in the proof. 

Lemma 5. Suppose that integers dy, d'y and dy, for v in V satisfy dy — d'y + d'y 
and d'y, d'y > 0. Let D, D' and D" denote the diagonal matrices with diagonal 
entries dy, d'y and d'y, respectively. Then the matrix X defined by 

X - D-V2 f P'JD' , D"JD" \ r.-ll2 
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has two nonzero eigenvalues 1 and rj satisfying 

«'\ / vol(G) 



^ = 1- E 



dv J Vvol(G")vol(G"), 
The eigenvector ^ which is associated with eigenvalue rj can be written as 

^ Vvol(G') vol(G"); 

Proof. The lemma will follow from, the following two claims. 

Claim 1: (j)o — D^^'^l/ y^vo\{G) is an eigenvector of X and M. 

Proof of Claim 1: Following the definition that M = D~^/'^AD~^/'^, 0o is an 

eigenvector of M . We can directly verify that 0o is also an eigenvector of X as 

follows: 

X^^^D-^'^^^''^' ^"'^" 



vol(G') vol(G"); V^^dM 



= D-^l^ {D' + D") ^^ 



,fm[{G) 



1/9 Dl 



VvoKG) 



y^^^KG) 

Claim 2: rj is an eigenvalue of X with the associated eigenvector ^. 
Proof of Claim 2: We consider 



= ^-1/2 

+ C-1/2 



vol(G') vol(G")/ Vvol(G') vol(G") 

D'l VD'D-^D'l D'l rD'D-^U'l 



vol(G') vol(G') vol(G') vol(G") 

D"l V*D"D-^D'l D"l rD"D-^D"l 



vol(G") vol(G') vol(G") vol(G") 

. ,„ D' 
D- 



_ n-i/2 -^'1 frD'D-^{D- D")l VD'D-^D"! 



vol(G') V vol(G') vol(G") 

_i/2 -D"! (l*D"D-^D'l 1*D"D-^{D -D')l 



vol(G") V vol(G') vol(G") 



vol(G') vol(G")/ V Vvol(G') vol(G") 
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as claimed. 

Since X has rank 2 (i.e., it is the sum of two rank one matrices), and we 
have shown that X has eigenvalues 1, 77, then the rest of the eigenvalues are 
0. D 

We now apply Lemma [5] using the fact that the normalized adjacency matrix 
M = D~^/^AD~^/^ has eigenvalues 1 and p = 1 — Ai. Together with Theorem 
[21 we have the following: 

Theorem 5. Suppose G is the union of two graphs G' and G" with degree se- 
quences {d'y) and (d") respectively. Assume both G' and G" satisfy the quasi- 
random property P(e/2). Suppose the normalized Laplacian of G has eigenvalues 
Ai = 1 — Pi, for i = 0, 1, . . . , n — 1 with associated orthonormal eigenvectors (pi. 
Then we have: 



1- Po = 1, 
2. pi satisfies 

-^<i-PiH'>;^i[ „..:?!gLj <e 




vol(G')vol(G"') 

3. \pi\ <efori>l. 

4. The eigenvector <j)i associated with Ai can be written as 

D'l D"l 



D-i/2 



vol(G') vol(G") 



with \\r\\ < e, where D' and D" denote the diagonal degree matrices of G' 
and G" , respectively. 

Theorem 6. Suppose a graphlets fJ is the union of two graphlets fi = f^i U il2 
and Vii are quasi-random graphlets. Then / — A has two nontrivial eigenvalues 
1 and rj where < 77 < 1 satisfies 

where Hi denotes the measure on Q.i. 

Proof. The proof follows immediately from Lemma [S] by substituting Hi{v) = 
d'{v)/vol{G') and ^2(w) = d"{v)/vo\{G") in Lemma [S] and Theorem [S] before 
taking limit as n goes to infinity. D 

In the other direction, we prove the following: 
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Theorem 7. Suppose that the normalized adjacency matrix of a graph G has 
two nontrivial positive eigenvalues 1 and p and the other eigenvalues satisfy 
\Pi\ ^ e for 2 < i < n — 1. Then for each vertex v, the degree dy can be written 
as dy — d'y + dy, with d'y,dy > 0, so that for any subset S of vertices, the number 
E{S) of ordered pairs {u, v), with u,v £ S and {u, v} G E, satisfies 



E{S) 



vol'(S')2 vol"(5) 



Yo\'{G) vol"(G) 



< 2evol(S') 



where vol'(S') = J^y^s ^v and yq)!' {S) = Y^ves ^' 



'II 



Proof. Let 0i,O<i<n— 1, denote the eigenvectors of the normahzed adjacency 
matrix of G. Let (f>Q and (f>\ denote the eigenfunctions associated with p^ — \ 
and p\. 

Since G is connected, the eigenvector (/iq associated with eigenvalue po = 1 
of Mg can be written as 0o = D^l^'^j \/yo\{G) as seen in \\% . The second 
largest eigenvalue p\ is strictly between and 1 because of the connectivity 
of G. Before we proceed to analyze the eigenvector 0i associated with pi, we 
consider the following two vectors which depend on a value a to be specified 
later. 



/i = aD\ - D^I'^^i^Jpi(x{\-(x)Yo\{G) 

/2 = (1 - a)D\ + B^^'^(^x\Jp\Oi{\ - a)vol(G) (38) 

It is easy to verify that /i and /2 satisfy the following: 

/i + /2 = Dl (39) 

^/i(^>) = avol(G), 

V 

Y,f2{v)^{l-a)yo\{G). 

V 

In particular, by considering (/i,-D~^/2}, we see that a satisfies 

1- 1 V h{v)f2{v) , . 

^^ a(l - a)vol(G) ^ dy ' ^ ' 

and we have 



' "(1~«) n-l/2 ffl h 



pivol(G) \a 1 — a 



Claim A: 



^000 + Pi'l>i(, 



j^-l/2f^f*jj-l/2 D-'/^f2fiD-y 



-1/2 



avol(G) (1 - a)vol(G) 
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Proof of Claim A: 
From ((38)) . we have 



avol(G) vol(G) 



Similarly, we have 

(1 - a)vol(G) = (^ - "^ vol(G) + "^^'^^^- 
Combining the above two equalities, Claim A is proved. 
Now, we define two subsets X and Y satisfying 



X 



{- : /i(-) < 0} = {. : d}J^ < M^)]/^^^^ 
Y = {y: f,{v) < 0} = {, : df < -My)sJ^^ 

Clearly X and Y are disjoint. 

Note that when a decreases, the volume of X decreases and the volume of 
Y increases. If a = 1, X consists of all v with (/)i(f) > and Y is empty. For 
a = 0, y consists of all u with (t>i{u) < and X is empty. We choose a so that 

El/i(^)l = E 1/2(2^)1- (42) 

Here we use the convention that a subset X' of X means that there are values 
7i, in {0, 1}, associated each vertex in X with the exception of one vertex with 
a fractional "f^ and the size of X' is the sum of all jyS. 

Now, for each vertex v, we define d'^ and d" as follows: 

r fi{v) iiv^XUY, 
< = <^ iiveX, (43) 

[ di, if i; e F. 

Also, we define d" = d^ — d'^. 
Claim B: 

E < = avol(G) 

V 

E< = (l-«)vol(G) 
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Proof of Claim B: We note that 



xi^X yeY 

xex yeY 

= Ei/i(^)i-E 1/2(2/)! 

xex yeY 

= 0. 

The second equahty can be proved in a similar way that completes the proof of 
Claim B. 

For a subset S of vertices, let xs denote the characteristic function of S 
defined by xs{x) = 1 if a; in 5 and otherwise. We consider 

< x*xD'^^MD'/\y 
< XxD^'^iMl + PiMDD^'^Y + 4D^'\x\\ \W\y\\ 



X*xhfiXY , XxhfiXY 



+ eVvol(X)vol(y). 



(44) 



avol(G) (l-a)vol(G) 

From the definition, we have x*xfi < 0, Xy/i > O7 Xxh > ^'^d Xyh < 
This implies 

X*xhfiXY XxhfiXY 



Vvol(X)vol(y) > -- 



avol(G) (1 - a)vol(G) 



X*xfifiXY 



avol(G) 



X*xf2fiXY 



(1 - a)vol(G) 



mxx\{yol{Y) - fixY) , mxY\{yol{x) - f*,xx) 



avol(G) (1 - a)vol(G) 

l/rxx|(voi(r) + |/2*xi'l) , \fixY\{yoi{x) + \f^xx\) 



> 



avol(G) 

|/i*xxLvoi(r) , voi(x). 



(1 - a)vol(G) 



■(- 



vol(G) ^ a 
by using ((38|) and p9|) . Now, we have 

vol(r) vol(X) 



1-a 



(45) 



a 



1-a 



a J V 1 — a 



> (v/voi(x) + v/voi(y))^ 

> 4v/vol(X)vol(r) 



(46) 
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by using the Cauchy-Schwarz inequality. Combining (|45p and (j46p . we have 

l/rxx| = l/2Xy|<^vol(G). (47) 

Now we consider 

D'JD' D"JD" 

Then, for / = XS: the eharacteristic function of the subset S', we have 

\J 1 J I J J \^ rl' \^ fl" 

ro'jD'j rD"jD"f ^ |ini/2,||2 

T d' T d" +^"^11-^ ■'ll 

^ rhfu rhfu rp'jD'f rD"JD"f g w^^ 

- avol(G) "^ (1 - a)vol(G) E. < E„ < + '"'^ ^ ^ 

^ (/*/i)^ - ird'f , (/V2)^ - (/*d-)^ , ^ 

- ^^;d(G^ + (i-a)voi(G) + ^'^°^^'^^- 

where d' and d" are the degree vectors with entries d'^ and d", respectively. 
Since f = xs, we have 

irfi? - irdr ^ '^E.esnx l/i(^)|vol^(^) + E.eSnx l/i(^)P 
avol(G) - avol(G) 

< 3evol(5) 



Similar inequalities hold for /2 and d". Thus, we have 

{f,Rf)<8evo\{S) 
The proof of Theorem [7] is complete. D 

Theorem 8. Suppose D, is a graphlets and / — A has two nontrivial eigenvalues 
1 and p with < p < 1. Then there is a value a G [0, 1] such that 
(i) n = fii U 172 where /J,{fti) = a and ^{0,2) = 1 — a, 
(a) Qi has a measure fit satisfying 



Pl{x) = p{x) + J- li{x)(J3i{x) 

" 1 — a 



Ii2{x) = t^{x)-\ p,[x)ipi(x), 

V a 

where ipi are orthonormal eigenvectors (under the p,-norm) associated with pi 



35 



The proof of Theorem [5] follows from the proof in Theorem [7] and Lemma [Sj 
Thus, we have (i) 4=> (m). 

Proof of Theorem Rl' We note that in the statement of Theorem |4l the implica- 
tions (ii) <;4> (iv) -i^ (v) follow from the definitions and Lemma [5] It suffices to 
prove (z) <4> (ii) and (in) <^ (iv). 

The implication (ii) => (i) is proved in Theorem [SJ and Theorems [7] and [5] 
implies (i) => (ii). 

To see that (Hi) <^ (*w), we note that if in a graph G„ in the graph sequence, 
the degree sequence dx can be written as dy = d'y + d'l for all v £ V{Gn) where 
<,< > 0, then by defining ^l^;'\v) = </E.<, M^"^(«) = </E.< and 
a = ^^ d'y/^v dv, we have /i„ = fi" + ^r^ ■ Furthermore, we can use the fact 
that 

/(x)(/-A„)5(a;K(.T) = ^^^ (/,(/_ A„)g}^„ 
and (/,1)^,„,(5,1)^(„, = 5] /(«)M")(u) ^ g(«)M(")(«) 

^ fD[,JD':,g 
vol(G„)2 • 



The equivalence of (Jii) and (to) follows from substitutions using the above two 
equations and applying Theorem [2] Theorem |4] is proved. D 

For graphlets of rank 2, there can be a negative eigenvalue —p of / — A in 
addition to the eigenvalue 1. For example, bipartite quasi-random graphlets 
have eigenvalues 1 and —1 for / — A. In general, can such graphlets be charac- 
terized as the union of a quasi-random graphlet and a bipartite quasi-random 
graphlets? To this question, the answer is negative. It is not hard to construct 
examples of a graphlets having three nontrivial eigenvalues which is the union of 
a quasi-random graphlets and a bipartite quasi-random graphlets. With addi- 
tional restrictions on degree distributions and edge density, the three eigenvalues 
can collapse into two eigenvalues. It is possible to apply similar methods as in 
the proof of Theorem ?? to derive the necessary and sufficient conditions for 
such cases but we will not delve into the details here. 



8 Graphlets of rank k 

In this section, we examine graphlets of rank k for some given positive integer k. 
It would be desirable to derive some general characterizations for graphlets of 
rank fc, for example, similar to Theorem HI However, for fc > 3, the situation is 
more complicated. Some of the methods for the case of fc = 2 can be extended 
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but some techniques in the proof of Theorem 3] do not. Here we state a few 
useful facts about graphlets of rank k and leave some discussion in the last 
section. 

Lemma 6. Suppose D is the diagonal degree matrix of a graph G. Suppose that 
for all V in V, dy — X]i=i di(v), for di(v) > 0, 1 < i < k. Let Di denote the 
diagonal matrices with diagonal entries Di(v,v) — di{v). Then the matrix X 
defined by 

has k nonzero eigenvalues rji where rji are eigenvalues of a k x k matrix M 
defined by 

d,{v)dj{v) 



M{z,j) = J2- 



dv 



Furthermore, the eigenvector ^i for X which is associated with eigenvalue rji can 
be written as 

k J f ^ .-1/2 

i=i ■' 

where ipi are eigenvectors of M associated with eigenvalues rji. 

Proof. The proof of Lemma [5] is by straightforward verification. Under the 
assumption that ipjM = rjjipi for 1 < i < fc, it suffices to check that S.iX = rji^ 
for ^i. The proof is done by direct substitution and will be omitted. D 

Theorem 9. // a graphlets fl is the union of k quasi-random graphlets, 

fi = fii U f72 U ... U fife 

then the Laplace operator A satisfies the property that / — A has k nontrivial 
positive eigenvalues. 

The proof of Theorem |9] follows immediately from Lemma [6] 

Several questions follow the above theorem. If / — A has k eigenvalues 
that are not necessarily positive, is it possible to decompose Q, into a number 
of quasi-random graphlets or bipartite quasi-random graphlets? Under what 
additional conditions can such decompositions exist? If they exist, are they 
unique? Numerous additional questions can be asked here. 



9 Concluding remarks 

In this paper, we have merely scatched the surface of the study of graphlets. 
Numerous questions remain, some of which we mention here. 
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(1) In this paper, we mainly study quasi-random graphlets and graphlets of fi- 
nite rank (which are basicahy 'sums' of quasi-random graphlets). It will be quite 
essential to understand other families of graph sequences, such as the graph se- 
quences of paths, cycles, trees, grids, planar graphs, etc. In this paper, we define 
the spectral distance between two graphs as the spectral norm of the 'difference' 
of the associated Laplacians. In a subsequent paper, we consider a generalized 
version of spectral distance for considering large families of graphlets. 

(2) We here use [0, 1] as the labels for the graphlets and the measure fi of the 
graphlets depends on the Lebesgue measure on [0, 1]. To fully understand the 
geometry of graphlets derived from general graph sequences, it seems essential 
to consider general mesurable spaces as labeling spaces. For example, for graph 
sequences C„ x C„, it works better to use [0, 1] x [0, 1] as the labeling space, 
instead. 

(3) In this paper we relate the spectral distance to the previously studied cut- 
distance by showing the equivalence of the two distance measures for graph 
sequences of any degree distribution. It will be of interest to find and to relate 
to other distances. For example, will some nontrivial subgraph count measures 
be implied by the spectral distance (see the questions and remarks mentioned 
in Section [S])? 

(4) In the study of complex graphs motivated by numerous real- world networks, 
random graphs are often utilized for analyzing various models of networks. In- 
stead of using the classical Erdos-Renyi model, for which graphs have the same 
expected degree for every vertex, the graphs under consideration usually have 
prescribed degree distributions, such as a power law degree distribution. For 
example, for a given expected degree sequence w = (dy), ior v e V, a random 
graph G'(w) has edges between u and v with probability pdudy, for some scaling 
constant (see (20)). Such random graphs are basically quasi-random of positive 
rank one. Nevertheless, realistic networks often are clustered or have uneven 
distributions. A natural problem of interest is to identify the clusters or 'local 
communities'. The study of graphlets of rank two or higher can be regarded 
as extensions of the previous models. Indeed, the geometry of the graphlets 
can be used to illustrate the limiting behavior of large complex networks. In 
the other direction, network properties that are ubiquitous in many examples 
of real-world graphs can be a rich source for new directions in graphlets. 

(5) Although we consider undirected graphs here, some of these questions can 
be extended to directed graphs. In this paper, we focus on the spectral distance 
of graphs but for directed graphs the spectral gaps can be exponentially small 
and any diffusion process on directed graphs can have very different behavior. 
The treatment for directed graphs will need to take these considerations into 
account. Many questions remains. 
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